Python Job: Site Reliability Engineer (London)

Job added on

Location

London, England - United Kingdom

Job type

Full-Time

Python Job Details

Join the dynamic Site Reliability Engineering teams across various domains and locations. As an SRE, you will play a crucial role in ensuring the high performance, reliability, and security of our systems. Each team focuses on different aspects of our infrastructure.

 

Please note: This is an on-site role for London, UK

Team:

As a Staff SRE Engineer on the Coordination team, you will:

  • Oversee the operation of software and services in the Coordination team.
  • Leverage deep expertise in networking, routing, and traffic patterns.
  • Leverage deep expertise in service discovery, distributed services, and cloud infrastructure.
  • Develop monitoring, alerting, and incident response solutions.
  • Contribute to ongoing enhancement efforts and champion reliability engineering best practices.

Who You Are:

  • Highly motivated team player with initiative.
  • Strong debugging, documentation, and communication skills.
  • Ability to work collaboratively in a dynamic environment.
  • Availability for occasional travel (up to 20%).

Qualifications:

  • Bachelor's degree or above in Computer Science, Engineering, or related field.
  • 5+ to 10+ years of experience in site reliability engineering or related roles.
  • Expertise in relevant technologies, such as CDN operations, containerization, incident management, traffic routing, and distributed systems.
  • Proficiency in scripting and automation (Python, Perl, Go).
  • Strong knowledge of Unix/Linux system administration at scale.